Revisiting the CAI from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models
نویسندگان
چکیده
Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. Both indices are based on fairly simple assumptions about which genes are most highly expressed, which were known when they were first derived: the CAI was originally based on the codon composition of a set of only 24 highly expressed genes, and the codon usage, on assumptions about which functional classes of genes are highly expressed in fast-growing bacteria. Given the recent advent of genome-wide expression data, we should be able to improve on these assumptions. Here, we measure, in yeast, the degree to which consideration of the current genome-wide expression datasets improves the performance of both numerical indices. Indeed, we find that by changing the parameterization of each model its correlation with actual expression levels can be somewhat improved, although both indices are fairly insensitive to the exact way they are parameterized. This insensitivity indicates a consistent codon bias amongst highly expressed genes. We also attempt direct linear regression of codon composition against genome-wide expression levels (and protein abundance data). This has some similarity with the CAI formalism and yields an alternative model for the prediction of expression levels based on the coding sequences of genes. More information is at
منابع مشابه
Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models.
Highly expressed genes in many bacteria and small eukaryotes often have a strong compositional bias, in terms of codon usage. Two widely used numerical indices, the codon adaptation index (CAI) and the codon usage, use this bias to predict the expression level of genes. When these indices were first introduced, they were based on fairly simple assumptions about which genes are most highly expre...
متن کاملIdentification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene
Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...
متن کاملCodon bias patterns in photosynthetic genes of halophytic grass Aeluropus littoralis
Codon bias refers to the differences in the frequency of occurrence of synonymous codons in coding DNA. Pattern of codon and optimum codon utilization is significantly different between the lives. This difference is due to the long term function of natural selection and evolution process. Genetics drift, mutation and regulation of gene expression are the main reasons for codon bias. In this stu...
متن کاملCodon optimization and cloning of bovine prochymosin gene for proper expression in tobacco plant
Bovine chymosin enzyme is one of the most commonly used enzymes in the dairy industry. The production of this enzyme from its natural source does not meet the needs of this huge industry. The production of recombinant bovine chymosin in plants can be a good alternative to native enzyme. Insertion and expression of foreign genes in plants can occur in the nucleus and chloroplast organelles. The ...
متن کاملافزایش ویژگیهای عملیاتی آنزیم اندوگلوکاناز از طریق تغییر اسیدآمینهای
Background & Aims : Ethanol produced from plant cellulose is called bioethanol and is recognized as a unique sustainable liquid fuel with powerful economic and environmental effects. In the present study we aimed at integrate a cellulase gene in to yeast genome to have the enzyme secreted out of the cell. Subsequently cellulose is depredated to glucose by the enzyme, and then it is ferment ...
متن کامل